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INTROiJUCTxON 2 

NLS provides a variety of coamands for file manipulation and 
viewing, kiditing coramands allow tne user to insert and cftange tne 
text in a file. Viewing coraiuands (viewspeca) allow the user to 
control how the systew prints or displays tne file. Line 
truncation and control of statement numbers are examples of these 
viewing facilities. ^a 

Occasionally one may need more sophisticated view controls than 

those available with the viewspec and viewchange features in n-ls. '^o 

For example, one way want to see only those statements that 

contain a particular word or pnrase. 2d1 

or one might want to see one line of text that compacts tne 
information found in several longer statements, <iD2 

One might also wish to perform a series of routine editing 

operations without specifying each of tae iHS commanas over and 

over again, dc 

User-written programs way tailor tne presentation of the 

information in a file to particular needs, iixperienced users may 

write programs taat edit files automatically, ca 

User-written programs currently must oe coded m amc*s 
procedure-oriented programming language, LIO, NLS itself is coded 
in LIO. LIO is a high-level language which must oe compiled into 
machine-readable instructions, de 

This document describes three general types of programs; 
— simple filters that control what is poTKAYEij ON 

THE USER'S teletype or display {Parts one and Two), 
--programs that may modify the statements as they 

decide whether to print them (Parts Two and Three), 
—those that, like commands, are explicitly given 

control of the joo and interact with the user (Part Four), dx 

User programs that control what material is portrayed taKe 

effect when NLS presents a sequence of statements in response 

to a cowmand like Print (or .Jump m DNLS), dil 

in processing such a command, i^L3 looks at a sequence of 
statements, examining each statement to see if it satisfies 
tne viewspecs then in force. At tnis point tua may pass tne 
statement \,o sl user-written program to see if it satisfies 
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the requirenents specified in that program, ix tne user 
prograw re\»urns n value of Ti<Uiji, tiie (passed) statement is 
printed and the next statement in the sequence is tested; if 
FALSii, NL3 just goes oii to tne next bt^tewent, «iila 

While the program is examining the statement to decide whether 

or not to print it# it may modify tae contents of the 

statement, such a program can do anything the user can do witn 

NL5 commanJs, 2f2 

For more complicated tasKs. control may be passed explicitly to 
the program, in this case, a user program appears as a 
special-purpose subsystem having (in addition to the supervisor 
commands) one or more commands, once such a program is loaded, 
it can be used Just like any of the standard suosystems, (ihe 
HjiiSSACirE program is an example.) 21J 

This document describes the 110 programming language used at akc. dg 

Part one is intendea for the general user. 2gl 

It is a primer on Content Analyzer patterns. This does not 
involve learning the LIO language nor programming. This 
section can stand alone, and tne general (if somewhat 
experienced) NiS user ahouid find it useful. iigla 

Part TWO is intended for the Beginning programmer. 2g2 

It presents a hasty overview of Lio programming, with enough 

tools to write simple programs. This is intended as an 

introduction for the beginning llO programmer, who we assume 

is reasonably familiar with tiLS (its commands, subsystems, 

and capabilities) and has some aptitude for programming. 2g2a 

Parts Three and Four are not included in this document. They are 

presently being updated, i^ou can read these sections online oy 

jumoing to the link <userguides,L10-c»uide,>. When completed: 2n 

Part Three win include a more complete presentation of LIO. 2nl 

It is intended to acquaint a potential LIO programmer with 
enough of the language and HIS environment to satisfy most 
requirements for automated editing programs, dany of the 
concepts in i-art Two are repeated in Part Three so that it 
may stand alone as an intermediate programmer's reference 
guide. itii^ 
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Part i^our wilx present more aavancea LIQ toola ana an 

introduction lo Cdl. allowing command synx*ax specification. 2n2 

TJiis siiouiu give the programmer t^^e aoility to write 

programs wuich work across files, whica move through flies 

in other than the standard sequential order, and wnich 

interact with the user, 2md 

We suggest that those wno are new to LIO oegin with section 1 

and read this document one section at a time, pausing between 

sections to try out the concepts presented by actually writins 

patterns or programs that put the new ide*s to experimental 

use. Hands-on experience is of ax» least as much value as tni3 

tutorial. If you have problems at any point, you should get 

help from ARC before proceeding to the next section. 2hJ 

rtore complete documentation can be found in (70i>2,l). for 

examples of user programs which serve a variety of needs, consult 

the User Programs Library iable of Contents 

(programs, -contents,!) , For information about commands mentioned, 

asK for the programming subsystem with the NLS Help command, i'nis 

document is available online in <userruides,L10-guide,>. kx 
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PnRT Qlii>: Content analyzer Patterns J 

iiection i: lutroauction ja 

Content analysis p^ttterns cannot aifect tne lormat oi a statement, 
nor can tney edit a liie. They can only aeteriiiine wuether a 
statement should be printed at all. They are, in a sense, a 
filter tnrough which you may view the file. More complex tasKs 
can oe accompiisnea through programs, as aescrioed later in tnis 
document, ^al 

The Content Analyser filter is created by typing in (or selecting 

from the text in a file) a strin? of a special form. This string 

IS caliea the "Content Analyzer Pattern", Lach statement is 

cnecKed against the pattern before it is printed; only statements 

that are described by the pattern will be printed. Ja^J 

Some QUICK examples of Content Analyzer Patterns: j<i.J 

' ( isLC • ) will show ail statements whose first 

character i& an open parenthesis, then i^ny 

number of letters or digits, then a close 

parenthesis, ja^a 

/■"biap"y will show all statements with the 

string "blap" sonewhere in tnem, Jsljo 

SINCE (3-JUN-73 00:00) will show all statements 

edited since June 3, 1:^73 ^a^c 

The next part of this section win describe the elements which 
raaKe up Content Analyzer Patterns, followed by some examples, Tne 
final suDject of tnis section is how to put them to use. jsm. 
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Section 2; Patterns 



jo 



Elements o£ Content Analyzer Patterns Jdjl 

Content Analyzer Patterns describe certain things the system 

must check beiore printing a statement, it may checK one or a 

series of things. The content Analyzer searches a statement 

from the beginning, character oy character, for described 

elements, as it encounters each element of tne pattern, tne 

content Analyzer checks the statement for the occurrence of 

that pattern; if the test fails, the whole statement is failea 

(unless there was an "or" condition, as described later) and 

not printed; if the test is passed, an imaginary marker moves 

on to the next character in the statement, and the next test in 

the pattern is considered. joxa 

The pattern may include any sequence of tne folloviing elements, 

the Content Analyzer moves the marKer through the statement 

checking for each eiement of the Pattern in turn: ^oio 



Literal Strings 

•c the given character (e.g. a lower case c) 

"string" the given string (may include 

non-printing characters, such as spaces) 
Character classes 

OH any character 

L lowercase or uppercase letter 

D ' digit 

UL uppercase letter 

LL lowercase letter 

ULD uppercase letter, or digit 

LLD lowercase letter, or digit 

LD lowercase or uppercase letter, or digit 

NLD not a letter nor digit 

PT any printing character 

wp any non-printing character (e.g. space) 
special Characters 

SP a space 

TAB tab Character 

CR a carriage return 

LF line feed character 

lOL TEKEX EOL Character 

ALl altiiiude character 

Special elements 

ENDCHK beginning and end of every 

statement; can't scan past it 



^bic 



Jbia 



^Die 



joit 
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TRUE 



ID= id 
IDf!^ id 



by user wnose 



Dy user wnose 



IS true witnout ctiecKing anytning 
m statement 
Statement createu 
ident is given 
statement not created 
ident is given 
statement editea cex'ore given date and time 
statement euitea since given aate anci time 
e.g. BEFORE (1 OCT I9'fk 00;QOi ; 
The date and time must both appear, in tiie parentheses. 
It accepts almost any reasonable date and time syntax. 
Examples oi valid dates: 

17 APRIJL Ik 

17/5/1974 

>/17/74 



ijEFORE (a-t) 
SINCE (d-t) 



1231i;56 

l:36AM 

1200N00N 



17-APR-Vl; 

APR-17-71t 

APR 17 Vli 

APRIL 17# I97k 
Examples oi valid times 

1:12:1^ 

123U 

1S56-EST 

16:iO (];:30 PM) 

12:00:00AH (midnignt) 

li:S9J59AM-ESX (late morning) 

12:00;CiAM (early morning) 
Scan direction 

< set scan direction to the xeit 

> set scan direction to the right 

The default, re-initialized for eacn new statement, is 
scan to the right. 

Combining Elements 



^oig 



3D2 



These elements may be combined in any order, spaces within the 
pattern are ignored (except in literal strings) so they may be 
used to make reading easier for you. Several operators can 
modify the elements j Jb2a 



NUMBER -- multiple occurrences 

A number preceding any element other than one of the 
"Special elements" means that the test will succeed only ii 
it finds exactly that many occurrences of the element* If 
there aren't that many, the statement will be rejected. 
Even though there may be more, it will stop after that many 
ana go on to check tne next element in the pattern. 

3UL means three upper case letters 



Jb2o 
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$ — range of occurrences 

A dollar sign i^t) preceaing an> element otiier tftan the 
"special elements" means "any number oj occurrences of"» 
This may include zero occurrences, 

$'- ineaijs any number cf aasnes 

A number in iront of the dollar sign sets a lower liwit. 
3^r neans three or iaore digits 

A number aiter the dollar sign sets an upper limit for the 
search. It will stop after that nuwoer and then cnecK tor 
the next element in the pattern, even if it could nave founu 
more, 

S3LD means from zero to three letters or GigiLs 
5v>7PT neans from j> to 7 (inclusive) printing 
characters 

[J -- floating scan 

To do other than a character ty character checK, you may 
enclose an element or aeries of elements in square bracicets 
[J, The Content Analyzer will scan a statement until the 
element is found • (If the element is not in square 
brackets, the wnole statement fails if the very next 
character or string fails the test of the next element.) 
This test will reject uhe statement if it can't find tne 
element anywhere in the statement, if it succeeds, it will 
leave the marker for the next test just after the string 
satisfying the contents of the square oracKets. 

"start" means checK to see if the statement 
begins with the string "start" (or, 
if it is in the middle cf a pattern, 
checK the next 5 characters to see 
if they are start). 

/"" start "y means scan until it finds tne 

string s t a r t» 
13DJ means scan until it finds 

three digits. 
/ 3D *iJ means scan until it finds three 

digits followed t)y a colon 



J02C 



Jadd 



-- negation 



jo'de 
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If an element is preceded by a minus sign -, tne statensent 
will pass that test if the eieraent does not occur. 

-Lij means anything other than a letter 
or digit, such as punctuation, 
invisibles, etc. 

You may put together any number of any of these to form a 

pattern, 3b2f 

e.g. 1$PT [''.mo}'* 1$d; -SP 

Logic in Patterns joj 

More sophisticated Patterns can by written t>y using tne logic 
features of LiO. Generally, an expression is executed left tc 
right. The following operations are done in the given order: 



/ 

NOT 

AND 

OR iD^a 

JQJD 

parentheses (and square brackets for floating scans) may be 
used to group elements, it is good practice to use 
parentheses lioerally. 



JOJQ 



/ means "either or"; the element win be true if either 
eieraent is true. 

{3D L / kli) means either three digits and a letter 
cr four digits. 

sometimes you may want want the scan to pass your marker 
over something if it happens to be there (an optional 
element). "TRUii" is true without testing the statement, if 
the other x,ests fail, the imaginary marker is not moved. 



{D / TRUE) 



looks for a digit and passes the 
imaginary marker over it. if the 
next character is not a digit it 
will just go en to the next test 
element in the pattern without moving 
the marker. Xhis test always passes. 
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i*e. It is used to scan past something (s) wnicn may or 
may not be tnere. 

Since expression^ are executed from iex^t to right, it aoes 
no good to nave TRUE as the tirst option. Uf it is first, 
the test Villi imwediateiy pass without trying to scan over 
any elements.) 



NOT 



JOJQ 



NOT will be TRUE if the element or group of elements 
enclosed in parentheses following the i^OT is false. 

NOT IJj Will pass if the nexx. character is neither 
a letter nor a digit, 

since the siasn is executea first, not u / *h win be true 
if the rexx, character is KEiTHhjR a digit nor the letter "h" 
It is the same as not (i)/'h). 



AND 



Jbje 



OR 



ANi; means both of the two separatee groups of elements must 
be true for the statement to pass, 

SINCE (3/6/73 CO!UO) Awij XDf^NDM means statements 

written since March 6, 1973 by 
someone other than NDM. 



OR means the test will be true if either oi the separated 
elements is true. It does the same thing as slash, but 
after "AND" and "NOT" have been executed, allowing greater 
flexibility. 

D AND LID OR UL means the same as iD Aj4D ILD) OR UL 
AND LID / UL means the same as D AND (LLD / UL) 

While such patterns are correct ano succinct, parentheses 
make for much clearer patterns. £.lements within 
parentheses are taken as a group; tne group will be true 
only if the statement passes all the requirements of the 
group. It is a good idea to use parentheses whenever 
there might be any ambiguity. 



joji 
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Section 3: Examples of content Analyzer Patterns jc 

D 2$LD / r*CA'7 / r'content Analyzer"; , 3cx 

Tnis pattern will match any of tftree types of stateraente: those 
beginning with a nuwericai digit followed by at least two 
Characters which way be either letters or digits, ana 
statements with either the patterns "CA" or "Content Analyzer" 
anywhere in tae statement, Jcia 

Note the use of the square cracKets to permit a floating 
scan -- a search for a pattern anywhere in the statement. 
Note also the use of the slash for alternatives, 

BiiFORE (25*JAN-7it 12:00) 3C-i 

This pattern will match those statements created or moaifiea 

before noon on 25 January 1972. Jc^a 

(ID » HQl) OK {ID ' NDW) 3Ci 

This pattern will match all statements created or modified oy 

users with the identifiers "HGL" or "NDM". jcjj* 

[{21 (SP/TRUE) / 2D) C '- kl^J 3C4 

This pattern will match characters in the form of phone numbers 
anywhere in a statement. Numbers matched may nave an 
alphabetic exchange followed by an optional space (note tae use 
of the TRUE construction to accomplish this) or a numerical 
exchange. jc^a 

Examples include DA 6-6200, CA6-6200, ana 326-6200. 

riiNDCKR; < "cba" 3C> 

This Will pass those statements ending with "aoc". I^ will go 

to the end of the statement, change the scan direction to left, 

and Check for the characters "coa". Note that since you are 

scanning backwards, to find "abc" you must iook for "cba". 

Since the "cba" is not enclosed in square brackets, it must be 

the very last characters in tne statement. jcj?* 
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Section ki Using the Content Analyzer ju 

Content Analyzer paX/terns may be entered in two v»ays; joi 

1) From the BASE subsystem, use the comiaand: jdxa 
Set Content (pattern) To pATTEhN OK 

2) From the PHOGPAMS suosystem, use the command: jaio 

Compile Content (pattern) PAIT£RN QK 

OK raeans "Command Accept", a control-D or, 
in TNLS (by default) a carriage return. 

In either case; 3CL^ 

1) patterns may be typed in from the Keyboard, or jdisa 

2) they may be addressed from a file, Ja2D 

In this case, tne pattern will be read from the first 
Character addressed and continue until it finds a semicolon 
(;) so you must put a semicolon at the end of the pattern 
(in the file) . 

Viewspec j n^ust be on (i.e. Content Analyzer off) when entering 

a pattern. ^d2c 

Entering a Content Analyzer Pattern itutomaticaiiy does two things s 3q3 

1) compiles a small user program from tne characters in tne 
pattern, and ^aja 

2) takes that program and "institutes" it as the current 
content Analyzer filter program, deinstituting any previous 
pattern. jqjo 

"Instituting" a program means selecting it as the one to 
take effect when the Content Analyzer is turneu on. Jfou may 
have more than one program compiled but only one instituted. 

when a pattern is deinstituted, it still exists in your 
program buffer space and may be instituted again at any time 
with the command in the PRUGitAi'lS subsystem: 

Institute Program PROGRaM-NAMH. (as) content (analyzer) OK 
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The programs may oe reiered to oy number instead oi 
name. They are numbered sequentiaxiy, tiie iirst 
entered oeing number i. 

All the programs you have compiled and the one you have 
instituted may be listed ^itn the commano in the programs 
subsystem: 

Shew Status (of programs cuiferi OK 

Programs may buixd up in your program buffer. To clear the 
program buffer, use the PROGRAMS subsystem comroana: 

Delete All (programs in buffer) OK 

We recommend that you do this before each new pattern, 
unless you specifically want to preaerve previous 
patterns. 

To invoke the content Analyzer: jqh, 

When viewspec i is on, the instituted Content Analyzer program 
(if any) wm check every statement before it is printed (or 
displayed) • jai^A 

If a statement does not pass all of the requirements of the 
Content Analyzer program, it will not be printed. 

In DNLS, if no statements from the top of the screen on 
pass the Content Anaiyzert the word "brnpty" wiil be 
displayed. 

Note: You will not see the normal structure since one 
statement may pass the Content Analyzer although its source 
does not. Viewspec m (statement numbers on) will help you 
determine the position of the statement in the file. 

When viewspec k is on, the instituted Content Analyzer search 
program win check until it finds one statement that passes tne 
requirements of the pattern. Then, the rest of the output 
(branch, piex, display screen, etc.) will be printed without 
Checking the Content Analyzer. 3Q4b 

When viewspec J is on, no Content Analyzer searching is done. 

This is the default state; every statement in the output 

(branch, piex, display screen, etc.) will be printed. Note 

that i, j, and K are mutually exclusive. jakc 
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Notes on the use of Content Analyzer ixltere; 3a> 

some H16 commanas are always afiectetl by tne current viewspecs 
(including i,j, or K) : ^ci;?a 

output 

jurrjp (in DNLi?) 

Frlnt (in xNiS) 

Most NLb conKanas ij^nore the Content Analyzer in their eciiting. 
The following BASS: autsystem coKnanus oxier the option of 
specifying viewspecs, or "Filters", (which may turn on the 
content Analyzer) which apply only for the purpose of that one 
command and affect what statements the command works on: Jdi^o 

Copy 

Delete 

Move 

SUDstitute 

At this point, i\. would be wise to practice until you becowe 

proficient at Content Analyzer patterns. You wight begin by 

trying to use some of the patterns given in the above examples, 

and then try writing a few patterns of your own. These patterns 

are both a usefux iUS tool and a basic component of many LiO 

programs. Jdb 
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PART TV^O; introducticn to LIO Programming 4 

Section i: Content Analyzer Programs 4a 

Introduction 4ai 

Wnen you specify a Content Analyzer Pattern, the PHOGRAWS 

subsystem constructs a program which looks for tne pattern m 

each statement and only displays the statement if the pattern 

matching succeeds. You can gain more control and do more 

things If you buiid the program yourself. The program will De 

used just like the simple pattern program and has many of tne 

same limitations, programs are written in nLS :)ust like any 

other text file. They then can be convertea to executable coae 

by a compiler. This coae resides (or is loaded) in your 

programs ouffer space; it can be instituted as the current 

content Analyzer filter program like a content Analyzer 

Pattern. 4aia 

Program structure 4aki 

If you specify a content analyzer Pattern, wls compiles a small 

program that looks like tnis (with the word "pattern" standing 

for whatever you typed in) : 4a2a 

PHUGKAH name 

(name) PROCfiCURE; 

IF FXNU pattern THEN K£TURN(TRUi^ fcLSE i<£iTUkN (FALSb) , 

END. 

FINISH 

All LIO prcgraMs must begin with a header statement, the word 

PROGRAM (all caps) followed by the name of the first procedure 

to be executed (all lower-case). This name is also the name of 

the program. If the program is being compiled into a file ito 

be described at the end of this section), the wcru FILjl snouid 

be substituteu for the word PHOGRiiH. 4a^ d 

iii.g. PROGRAM first 
FILE deldir 
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(Note: the Content Analyzer maKes up a program name 
consisting q± up^^Ixxxxx , where 

# ia a sequential namber, the firat pattern being numoer 
one, anu 

xxxxx is the first live characters of your pattern.) 

The body of a program consists cf a series of bhCLARAnOi^ 
statements ana PPOCEDUR&S (in any orcSer) • in tae atove case, 
the prograi!! consisted of only one small procedure anci no 
declarations, when the program is loadea into your programs 
buffer space, the declarations reserve spdce in tne systeia to 
store information (variables), i^hen the program is used as a 
Content Analyzer filter program, the firs;, procedure is called 
for each statement. It may in turn call other procedures and 
access variables in the program or in the NLS system. i^a'^ic 

e.g. DECLARE X, y, z (descrioed belovi) 
(first) PFQCEDURiL; 
♦ • • 

The end of the program is delimited by the word "FiNiSh" (m 

ail upper case). iiak^a 

Comments may be enclosed in percent signs ()•) anywhere m tne 

program, even in the middle of Lio statements. The LxO 

compiler wm ignore them. 4aie 

Except within literal strings, variable names and special L-lO 

words, spaces are Ignored, It is rood practice to use tiiem 

liberally so that yuur program will be easy to read. Also, Nl^ 

file structure is ignored. Structure is, however, very 

vai viable in making the program readable, and it is good 

practice to use it m close correlation to the program's 

logical structure. For instance, the programmer usually maKes 

each Of the elements of a program (declarations, procedures, 

and FINXSh) separate statements, below the header statement in 

file structure. Xhis point wiii be discussed further later. aadi 

So far, we have file which looks something like: 4a2g 

i^HUGHAH name± 

DbCLAftE ... ; 

Q&ClAHE ... ; 
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(namel) PHOCEDURE ; 

(narae2) PHOCEDURE ; 

FINISH 

Procedure Structure 4aJ 

Each procedure must begin witn its header statement. This 

header statement is a name enclosed in parentheses followed by 

the word PROCEDURE^ and terminated by a semxcoion. 4a^a 

e.g. (name) PROWECURE ; 

The body of the procedure may consist of Local declarations^ 

then LlO atatementii. An Lio statement is any program 

instructiori terminated by a semicolon. The body must at some 

point return control to the procedure that called it. All this 

will be discussed more later. 4a3o 

The procedure must end with the terminal statement; ^a^c 

END. 
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bxaraple: 4a4 

FHOiaRAM con:pare 4^4* 

'^ Contert iinaiyier. Displays statement if first two 

visibles are tne sap.e. % 

L'ECLAEE TEXT POINTEK ptl, pt^, pt3* ptu; ^^ireserves 

space for ("ueclares") lOur 
text pointers naneo "ptl" 
through "ptli"% 
DECLAP^ STkING visi/'ioo;, vis2/100y; '/<>reserve» luo 

Characters 01 space ±or each 
Of two string variables named 
"visi" and "vis2"//c 
(compare) PKOC£ijURE ; 

lif' FINE SNP Tptl 13pT,rpt2 ^NP tpt3 1*PT Tptl^ TriJtN 

%set pointers around first 
twc visiDles (strings of 
printng characters) >*» 
BEGIN itif it found two visioles'/b 

^^visl*- «- ptl pt2 ; %put visibles in strings* 
*vis2^* «• pt3 ptli ; 

IF *visl# = *vis2<^ THEj^ Hi::HJitN(Akub) ; ^compare 

contents 01 strings, return 
and display the stateiaent 
if Identical* 

RETUFN {FALiiiLi ; ^Otherwise, return ana don't 

displays 
End. 
FINISH 

Declaration Statenents 4ai? 

ka you may have guessed fron tne above example, Content 

Analyzer piograws can deal with variaoles (liKe text pointers 

and strings), while patterns cannot. 4a3a 

Text pointers 4ai>o 

A text pointer points to a particular location within an Nj-S 
statement (or into a string, as aescrioed later). 

The text pointer points between two characters in a 
statement. H^ putting the pointers oetween characters, a 
single pointer can be used to mark ooth the end of one 
•string and tne beginning cf the string starting with tne 
next character. 



page l( 



LIO Users* Guide AKU 24426 i^ev. t WQV 74 

Part Two: Content Anaxyzer Programs 



Text pointers are c5eclare<3 with the ioxlowing Declaration 
statement: 

DF.CLARE T£XT POINTER name ; 

Strings 4a>c 

String variables hold text. When they are declared, the 
maximum number of characters is set, 

TO declare a string: 

D&CLAKE STRING name/num; ; 

num is the maximum number of characters allowed for tne 
string, 

e.g. DiiCLAfiE STRING Istring/'lOO; ; 

declares a string named "istring" with a maximum 
length of 100 characters and a current length of u 
Characters {it's empty). 

you can refer to the contents of a string variable t^y 
surrounding the name with asterisks, 

e.g. #istring» is the string stored in the 

variable named "istring". 

You can put the text between two text pointers in a string 
variable wxth the LIO statement: 

♦istring* ♦• ptrl ptr2 5 

Where ptrl and ptr2 are the names of previously declared 
and set text pointers, and Istring is a previously 
declared string variable. 

These variables will retain their value frow one statement to 

the next, other types of variables and their use will be 

discussed in detail in Part Three, Section 3. iiA^a 

Body of the Procedure 4^6 

RETURN Statement 4a6a 

NO matter what it does, every procedure must return control 
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to tne proceuure that called it. The statement wnich aces 
this is t^e i^E'iUKN statement, 

e.g. RtTURN; 

A i^hTUnN Statement may pass values to tne procedure that 
called it, The values nust ce enclosea in parentiiesea after 
the wor'-J HiTURN. 

e.g. PETUi^K (1,23,11.7); 

A content ^inalyzer program must return eitner a value oi 
TkUi; or of FALSL. If it returns the value TKUb (i), the 
statement will oe printed; if it returns fALiii (0), the 
statement will not fce printed. 

I.e. RiiTUKiN (TRUi;); will print tue statement 

Kfi-TUKN (FALSi:); will not print tne statement 

The KETUPN statement often is at tne eno of a procedure, Dut 
it need nox. oe. For example, m tne laiddie of the procedure 
you may Vvant to either hetUkI^ or go on oepenaing on the 
result of a test. 

Other than the requirement of a H£TURN statement, the body of 
tne procedure is entirely a function of tne purpose of the 
procedure. A few of the many pcssiole statements wij.1 pe 
described here; others will oe introduced in Part Three of thid 
document. i+aeu 

FIND Statement 4acc 

one of the nost useful statements for Content Analyzer 
proBiraias is the FIND statement. The FINl/ statement 
specifies a concent Analyzer pattern to be tested against 
the statement, and text pointers to oe manipulated and set, 
starting fror.j the Current Character position (that invisible 
marker refered to in Section 1), If the test succeeds, Lhe 
character position is moved past the last character read. 
If the test fails, the character position is left at the 
position prior to the FIND statement and the values of ail 
text pointers set wiuhin the statement will be reset. 

FIND pattern ; 

The Current Character Position is initialized to bSFOKi- 1H& 
FIPST CHARACTEit, and the scan direction is initialized to 
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left to RIGHT, FUR EACH NEw bTATEHENT passed to tne Content. 
Analyzer prograin. 

Any simple Content Analyzer pattern (as descrioe above) is 
valid in a FIND statement, xn addition, the following 
elements can be incorporated in the pattern: 

#stringname<^ 

the contents of the string variable 

tptr 

store current scan position into the text pointer 
specified by ptr, the naiae of a declared text pointer 

^NUM ptr 

back up tne specified text pointer by the specifiea 
number (NUh) of characters, if NUM is not specified, 
1 Will be assumed. aacKup is xn the direction 
opposite to the current scan direction. 



ptr 



Set current character position to this position, ptr 
is the narae of a previously set text pointer. 



sr{ptr) 



The Current Character position is set to the front 01 
the statement in which tne text pointer ptr is set ana 
scan direction is set froiir left to right, 

S£(ptr) 

The Current Character position is set to the end of 
the statement m which the text pointer ptr is set and 
scan direction is set from right to left. 

BETWEEN ptr ptr (pattern) 

Search limited to between positions specified, ptr is 
a previously set text pointer; tne two must be in the 
same statement or string. Current Character Position 
is set to first position before the pattern is tested. 

e.g. bETWEEN ptl pt2 (2D /.; #NP) 
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Bli^Ds may oe usea as expressions as weiX as iree-stanuing 
statements, l£ used as an e;ipression, ior example in li? 
statenents, it iias the value Th'Ol if ail pattern elements 
within it are true and tne value FALSb li any one of the 
elements is false, 

e.g. IF Flwjij pattern '£hEH ... ; 

complicates example: 

IF FIND tsf siiJP M ^(LD/'-) •) I". " ♦str^ty Shisf) ^nP 
'. THEN RKTUBWlTPUE) ELSE RhTURN (FALiJE) i 

li citatement 4.a6a 

IF causes execution of a statement if a tested expression is 
TRUE. If it is FALSi* and the optional ELSfci part is present, 
the statement foiloving the tiSF- ia executea. Control tnen 
passes to Lhe statement immediately following the IF 

statement. 

It testfexp 'irijiiN statement ; 

IF testexp TriiiN statei'ientl ELSE statements ; 

The statements 'Within the IF statement can oe any valia *.10 

statement, but are not followed uy the usual semicolon; the 

whcie IF sLatenent is treated liKe one statement and 
followed by the semicolon, 

e,g. 

IF FIND I'^Lj VHEN RETURfUi? AISE) ELSE HETUHiUTRUE) ; 

ProKramming Style: File Structure nAf 

You may remember that the compiler whicu converts your NLi> text 

to coae ignores file structure. This allows you to use 

structure to make your prograj.i text easier to read and 

understand. Logical use of structure often facilitates tne 

actual programming task as well, some conventions have 

developed at aUQ in this respect. All of tnese should seem 

obvious and logical to you. 4a7a 

All declarations and PROCEDUi<£ statements snould be one 
level below the PROGRAM statement. 



page 'di 



LIO Users' Guiae ARC 21tj;26 Rev. 6 WuV 74 

Part Two; Content Analyzer programs 



All local declarations (not yet aescriued) and code should 
be one level below the PROCEEURE statement. 

It is good style, and nakes for much easier programming, to 
list What you want to do as comment statements (in percent 
signs) at the level below the PRQCi-DUKi!; statement. Then you 
can go tack and iill in the code that accomplishes the tasK 
aescribed in each comment statement. Tne coue should go one 
level below the comment. 

We will later describe how to block a series oi statements 
Where one is required. These blocks should go a level oelow 
the statement of which they are a part. 

File structure should follow the logical structure of the 
program as closely as possible. 

e.g. XF FlHli [Sl^J 

'£Hhh KETURM(TRUl:) 

ELSE RETURN (FALSE) ; 

Using Content Analyzer Programs 4ao 

Once the Content Analyzer prograia has been written (in an wLS 
file), there are two steps in using it. First, the program 
must be "compiled," i.e. translated into machine-readable code; 
the compiled code is "loaded" into a space reserved for user 
programs (the user programs buffer)? secondly, the loadea 
program must be "instituted" as the current Content Analyzer 
program. i^aoa 

There are two ways to compile and load a program: 4aob 

1) You may compile a program and load it into your programs 
buffer all in one operation. The program heaaer statement 
must have the word PROGRAM in it. When the user resets nis 
job or logs off,* the compiled code will disappear. 

First, enter the Programs subsystem with the commana; 

Goto Programs OK 
Then you may compile the program with tne command; 

Compile LIO (user program at) SOURCE OK 
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SOUHUE is x-he adaress of tae phOGRAM statement, 

2) ^ou may compile a program into a xiie and then load xt 
into your tuifer as a separate operation, ine progran can 
then be loaded from the file into your user programs bufter 
at any time without recompiling. The header statement must 
use the word FILE instead of P«OiiRAM. Use the PROGRAHii 
suDsystem corumanU; 

Compile File iat) SuuRGJi (usin^) LxO (to file) FILiLi^iAi-ii; 
OK 

Tne FILiiNAHii: must be the saiae as the program's name, 

Tae code iile is called a R£L (RELocataoie code) file. 
Whenever you wish to load the program coae into the user 
programs buffer, use the frOGkAmS subsystem command: 

Load KiiL (file) FILENAME OA 

Once a compiled program has been loaded (oy eitner route), it 
must be instituted. This is done with the pkogkAks subsystem 
command: 4aoc 

institute Program PROGRAii-NAlii: 

{as) Content (analyzer program) OK 

ihe named program will be instituted as the current Content 
Analyzer program, and any previous program wili be 
demstituted (out will remain m the oufier). 

Again, the prograris in the bufxei' are numbered, the fii'st in 
being number one. You may use the nunuer instead of the 
program's name as a shorthand for PROGKAM-NAi^E, 

To mvoKe the Content Analyzer using whatever program is 

currently instituted, use the viewspec i, j, or k, a^ described 

in Fart one, isecticn k ^3f^k) * iiaou 

Problems lia^y 

Given these few constructs, you should now oe able to write a 

number of useful Content Analyzer programs. Try programming 

the following: 4ay«i 

1) 5how those statements which nave a number somewhere m 
the first ^o characters. 
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2) Show tnose statements wiiere the first visible in the 
fitifcteinent is repeated sonewhere in the statement. 
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Sample solutions: *^*^^^ 

Proclem 1 

PiiOQIJAM number 

DECLARE TEXT POlwTE« ttrl, ptriJ ; 
(number) Pi«OCEiJURE 5 

YIUD tptrl *20CH tptr^ ; 

IJf' FIND BETWEEN ptri ptr2 ( [Dj ) 

IHEN RETURN (TRUE) 

ELSt RETURN (FALSE) i 
END. 
FINISH 

ProDxeifl 2 

PROCiRAH vis 

DECLARE TEaT POINTER ptri, ptr2 i 
DECLARE STRING 3tr/500; ; 
(vis) PROCEDURE ; 

FIND ^NP tptrl l^PT tptr2 ; 
#3tr» <- ptrl ptr2 ; 
IF FINt ptr2 /NP *3tr# NP; 
THEN RETURN (TRUE) 
EL3E RETURN (FALSE) J 
END. 
FINISH 
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Section 2: content Analyzer programs: Moaifymg ^jtatements ko 

Introduction mi 

Content Analyzer programs may edit tne statements as well as 
decide whether or not they are printed. *Aney are very useful 
Where a series of editing operations has to be done tine and 
time again. This section will intnoduce you to these 
capabilities. All these constructs wii oe coverea in detail m 
Part Three, 4Dxa 

A Content Analyzer program has several limitations. It can 

manipulate only one file and it can look at statements only in 

sequential oruer Us they appear in the file). It cannot oacK 

up and re-examine previous statements, nor can it sKip ahead to 

other parts of the file, it cannot interact with the user. 

Part iour provides the tools to overcome these limitations. 4010 

String construction ii.t?2 

Statements and the contents of string variables may be modiiied 

by either of the following two statements: iiOia 

ST ptr <■ strlist ; 

The whole statement in which the text pointer namea "ptr" 
resides will be replaced by the string list (to be 
described in a minute), 

HI ptr ptr <■ fetriist ; 

Tne part of the statement from the first ptr to the 
second ptr will be replaced by the string list. 

ptr way be a previously set text pointer or 5F(ptr) or 
SElptr). 

String variables may also be modified with the string 

assignment statement: 4b2D 

*stringname* «• strlist ; 

Tne string list (strlist) may oe any series of string 

designators, seperated by commas. The string designators may 

be any of the following (other possltoilities to be described 

later) : 4b2c 
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a string constant, e.g. "AdC" or % 

ptr ptr 

ttie text between two text pointers previously set in 
eltiier a vStatement or a atrxng 

*stringrame*- 

a string nai.ie in asterisks, referin^ to tne contents of 
tiie string 

E.g.: k02Q 

ST pi p2 ♦■ <tstring# ; 

or 
ST pi *• SF(pl) pi, string, p2 S£:(p2); 

(Wote: these nave exactly ttie same meanxng.i 

Example: 4d3 

PKOQRAM delsp 4D^a 

% Content analyzer. Deletes aj.1 leading spaces from 

stateinerts. % 

DECLARii TEXT POINTEF ptj Preserves space ior 

("declares") a text pointer 
named "pt"% 
idelsp) PROCiiDuiiii; ; 

IF FIND l^SP tpt ThjlN iiscans over leading spaces, 

tnen seta pointer'/^ 
ST pt <- pt SE(pt); ^replaces statement with text 

ircm pointer to statement ena* 
iiiiTURN (FALSt) ; ^return, don't display anything^ 
END. 
tINIiiH 

More Tnan One Change per Statenent kck 

Part of a text pointer is a character count. This count stays 
the same until the text pointer is again set (to some otner 
position), even though the statement has teen edited. If, for 
example, you have the statement i^oi^a 

abcdefghijklmnopqrstuvwxyz 

and if you have set a pointer between the "d" and the "e'S it 
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Will always point between the fourth and fifth characters in 

the statement, if you then delete the character "a", your 

pointer will be between the "e" and the "f", now the fourth and 

fifth characters, j^or this reason, you probaoiy want to ao a 

series of edits beginning with the last one in the statement 

and working backwards through the statement. 4b4b 

Controlling v^hich statements are Modified kOi> 

In TNLS, the Content Analyzer program will be called for 

commands wMich construct a printout of the file (Print ana 

output). The program will run on every statement for which it 

is called (e.g. every statement in the branch during a print 

Branch command) which pass all the other viewspecs. once you 

have written, compiled, and instituted a program which does 

some editing operation, the Print command is the easiest way to 

run the program on a statement, branch, plex, or group. 4o>a 

In DNLS, the system will call tne Content Analyzer program 
Whenever the display is recreated (e.g. viewspec i and the Jump 
commands), and also for the Output commands. i£ the program 
returns TRUE, it will only run on enough statements to fill tne 
screen. It is safer to have programs that edit the file return 
FALSE. Then When you set viewspec i, it will run on all 
statements from the top of the display on, and when it is aone 
it will display the word "Empty". At that point, change to 
Viewspec d and recreate the display with viewspec f, then all 
statements including the changes will be displayed. You can 
control Which statements are edited with level viewspecs ana 
the branch only (g) or plex only (1) viewspecs. 4030 

After having run your program on a file, you may wish to update 

to permanently incorporate the changes in the file, it is wise 

to Update before you run the program so that, if the program 

does something unexpected, you can Delete Modifications and 

return to a good file. 4bi»c 

Problems » 06 

Try writing the followini; programs: ^^^^ 

1) Remove any invisibles from the end of each statement. 

2) Make the first visible a statement name (surrounded ty 
parentheses) if it is a word (letters and digits). 
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sample solutions: koto 

ProDien 1 

PROCtRAM endinv 

DECLARE TEXT POINTER ptr ; 
(endinv) PKOCEDURxi: ; 

IF ¥1UL tptr o£.(ptr) i^NP Tpcr 
THlN ST ptr «- SF(ptr) ptr ; 
R«;7URM (FALSE) ; 
END, 
FINISH 

ProDxen '/ 

PROGRAM naKename 

DECLiiHit TEXT POINTER ptrl, ptr2 ; 
(piaKenarae) PROCEDUHE ; 

IF FINC SNP tptrl i^LC tp^r2 wp 

THlN ST ptrl «- •(# ptrl ptr2, »), ptrii 5E(ptr2)i 

RiiTURN (FALSE) 
END. 
FliaSH 
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